Inducing Cost-Sensitive Non-Linear Decision Trees

نویسنده

  • Sunil Vadera
چکیده

This paper presents a new decision tree learning algorithm that takes account of costs of misclassification. The algorithm is based on the hypothesis that non-linear decision nodes provide a better basis for cost-sensitive induction than axis-parallel decision nodes and utilizes discriminant analysis to construct non-linear cost-sensitive decision trees. The performance of the algorithm is evaluated by applying it to seven data sets and the results compared with those obtained by two well known cost-sensitive algorithms, ICET and MetaCost. MetaCost is applied both with a base learner that uses axis-parallel spits and a base learner that uses non-linear splits, thereby enabling an evaluation of the value of non-linear decision nodes. The results show that the new algorithm displays a better profile than ICET as the ratio of costs of misclassification departs from unity and that it provides a better base learner for MetaCost than an axis-parallel learner.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Goal-Directed Classification Using Linear Machine Decision Trees

Recent work in feature-based classi cation has focused on non-parametric techniques that can classify instances even when the underlying feature distributions are unknown. The inference algorithms for training these techniques, however, are designed to maximize the accuracy of the classi er, with all errors weighted equally. In many applications, certain errors are far more costly than others, ...

متن کامل

Inducing safer oblique trees without costs

Decision tree induction has been has been widely studied and applied. In safety applications, such as determining whether a chemical process is safe, or whether a person has a medical condition, the cost of misclassification in one of the classes is significantly higher than the other class. Several authors have tackled this problem by developing cost-sensitive decision tree learning algorithms...

متن کامل

Cost-sensitive Decision Trees with Post-pruning and Competition for Numeric Data

Decision tree is an effective classification approach in data mining and machine learning. In some applications, test costs and misclassification costs should be considered while inducing decision trees. Recently, some cost-sensitive learning algorithms based on ID3, such as CS-ID3, IDX, ICET and λ-ID3, have been proposed to deal with the issue. In this paper, we develop a decision tree algorit...

متن کامل

Constructing Cost Sensitive Decision Trees Based on Multi-Objective Optimization

We propose a multi-objective optimization based on the cost sensitive decision tree building method. The misclassification cost, test cost, waiting time cost and information gain rate as four optimization goals by using the method of linear weighting are adopted to transfer the multiobjective optimization problem into a single objective optimization problem, as the splitting attribute selection...

متن کامل

Cost-Sensitive Decision Trees with Pre-pruning

This paper explores two simple and efficient pre-pruning strategies for the cost-sensitive decision tree algorithm to avoid overfitting. One is to limit the cost-sensitive decision trees to a depth of two. The other is to prune the trees with a pre-specified threshold. Empirical study shows that, compared to the error-based tree algorithm C4.5 and several other cost-sensitive tree algorithms, t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005